Efficient Reward Functions for Adaptive Multi-rover Systems
نویسندگان
چکیده
This paper addresses how efficient reward methods can be applied to multiple agents co-evolving in noisy and changing environments, under communication limitations. This problem is approached by “factoring” a global reward over all agents into agent-specific rewards that have two key properties: 1) agents maximizing their agentspecific rewards will tend to maximize the global reward, 2) an agent’s action has a large influence over its agent-specific reward allowing it to evolve quickly. Agents using these agent-specific rewards are tested in episodic and non-episodic, continuous-space multi-rover environment where rovers evolve to maximize a global reward function over all rovers. The environments are dynamic (i.e. changes over time) and can be noisy and can restrict communication between agents . We show that a control policy evolved using these agent-specific rewards outperforms global reward methods by up to 400%. More notably, in the presence of a larger number of rovers or rovers with noisy and communication limited sensors, the proposed method outperforms global reward by a higher percentage than in noise-free conditions with a small number of rovers.
منابع مشابه
Distributed Fuzzy Adaptive Sliding Mode Formation for Nonlinear Multi-quadrotor Systems
This paper suggests a decentralized adaptive sliding mode formation procedure for affine nonlinear multi-quadrotor under a fixed directed topology wherever the followers are conquered by dynamical uncertainties. Compared with the previous studies which primarily concentrated on linear single-input single-output (SISO) agents or nonlinear agents with constant control gain, the proposed method is...
متن کاملAdaptive Consensus Control for a Class of Non-affine MIMO Strict-Feedback Multi-Agent Systems with Time Delay
In this paper, the design of a distributed adaptive controller for a class of unknown non-affine MIMO strict-feedback multi agent systems with time delay has been performed under a directed graph. The controller design is based on dynamic surface control method. In the design process, radial basis function neural networks (RBFNNs) were employed to approximate the unknown nonlinear functions. S...
متن کاملAdaptive Neural Network Method for Consensus Tracking of High-Order Mimo Nonlinear Multi-Agent Systems
This paper is concerned with the consensus tracking problem of high order MIMO nonlinear multi-agent systems. The agents must follow a leader node in presence of unknown dynamics and uncertain external disturbances. The communication network topology of agents is assumed to be a fixed undirected graph. A distributed adaptive control method is proposed to solve the consensus problem utilizing re...
متن کاملDeveloping Self-adaptive Melody Search Algorithm for Optimal Operation of Multi-reservoir Systems
Operation of multi-reservoir systems is known as complicated and often large-scale optimization problems. The problems, because of broad search space, nonlinear relationships, correlation of several variables, as well as problem uncertainty, are difficult requiring powerful algorithms with specific capabilities to be solved. In the present study a Self-adaptive version of Melody Search algorith...
متن کاملReliability and Sensitivity Analysis of Structures Using Adaptive Neuro-Fuzzy Systems
In this study, an efficient method based on Monte Carlo simulation, utilized with Adaptive Neuro-Fuzzy Inference System (ANFIS) is introduced for reliability analysis of structures. Monte Carlo Simulation is capable of solving a broad range of reliability problems. However, the amount of computational efforts that may involve is a draw back of such methods. ANFIS is capable of approximating str...
متن کامل